Genes involved in the biosynthetic pathway of carbapenem

ABSTRACT

PCT No. PCT/GB95/01125 Sec. 371 Date Jun. 3, 1997 Sec. 102(e) Date Jun. 3, 1997 PCT Filed May 18, 1995 PCT Pub. No. WO95/32294 PCT Pub. Date Nov. 30, 1995DNA involved in the bacterial gene expression of carbapenem antibiotics comprising: a) at least one of the genes carA, carB, carC, carD, carE, carF, carG, carH, b) DNA capable of hybridizing to any of the genes defined in a) and capable of functioning as such genes in the biosynthetic pathway of a carbapenem, c) DNA which is a) or b) above by virtue of the degeneracy of the genetic code. Polypeptides encoded by such DNA.

This application was filed under 35 U.S.C. 371 from PCT/GB95/01125 filed May 18, 1995.

FIELD OF THE INVENTION

The present invention relates to the bacterial gene expression of carbapenem antibiotics.

BACKGROUND OF THE INVENTION

The carbapenem antibiotics constitute a diverse group of β-lactam antibiotics characterised by potent anti-bacterial and β-lactamase-resistant activity. More than forty different carbapenems are known, most of which are produced by the actinomycetes, particularly Streptomyces spp (Ratcliffe and Albers-Schonberg, 1982; Brown 1984; Williamson 1986; all cited in Baiton et al 1992).

Carbapenems have been isolated from the Gram-negative bacterium Serratia marcescens and Erwinia carotovora by Parker et al. (1982) and in Azospirillum spp UK 1521 by Kintaka et al. (1985); all cited in Bainton et al 1992.

Bainton et al (1992) have recently shown that carbapenem biosynthesis is regulated by the regulatory factor N-(3-oxohexanoyl)-L-homoserine lactone (known as HSL or OHHL). This compound was previously only known for its role in auto-induction of bioluminescence in the marine bacterium Vibrio fischeri. OHHL is also structurally related to the A- and I-factors which are known to regulate production of antibiotics in some Streptomyces species.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the cosmid cWU142 and the organization of the car genes R,A,B,C,D,E,F,G,H.

FIG. 2 shows plasmid pSMG12.

FIG. 3 shows plasmid pSMG13.

FIG. 4 shows the entire nucleotide and predicted amino acid sequence of carR, carA, carB, carD, carE, carF, carG and carH.

DETAILED DESCRIPTION OF THE INVENTION

In order to examine the biosynthetic and regulatory mechanism involved in the production of the β-lactam antibiotic 1-carbapen-2-em-3-carboxylic acid by Erwinia carotovora, blocked mutants were obtained with a carbapenem non-producing phenotype (Car ) as described by Bainton et al (1992). These mutants fell into two distinct groups: group 1 mutants secreted a low molecular mass diffusible factor which restored carbapenem biosynthesis in group 2 mutants, but not vice versa. This factor was shown to be OHHL. Class 1 mutants produced OHHL and were thus thought to be defective in carbapenem biosynthetic genes.

In order to study class 1 mutants a chromosomal DNA cosmid library of Erwinia carotovora strain ATCC 39048 was constructed in Escherichia coli. The cosmids produced were used in standard complementation studies to find a sequence which could restore the carbapenem antibiotic production in the class 1 mutants.

One cosmid (cWU142) was presumed to contain the carbapenem biosynthetic genes. Restriction fragments of this cosmid were sub-cloned and a 3.8 Kb EcoRI fragment was found to complement 7 out of 8 class 1 mutants. This fragment was sequenced and shown to comprise 2 Kb of cosmid DNA and 1.8 Kb of Erwinia DNA. This was extremely unexpected because known antibiotic biosynthetic gene sequences are much longer than 1.8 Kb.

The 1.8 Kb gene sequence was found to encode CarR, a homologue of the LuxR regulatory protein from the Lux operon system associated-with the bioluminescence phenotype of the marine bacterium V. fischeri.

In V. fischeri OHHL synthesis is coded for by the gene luxI. In Erwinia carotovora, OHHL synthesis is encoded by a luxI homologue which has been named carI.

By analogy with the V. fischeri Lux system, the inventors postulated that when OHHL is made, it binds to CarR which can then act as a transcriptional activator of the carbapenem biosynthetic genes. Thus, the inventors reasoned that 7 out of 8 of the class 1 mutants are not, as expected, defective in genes required for synthesis of carbapenem, but in a gene encoding a regulatory protein CarR, needed to switch on the carbapenem biosynthesis genes. Without the carR gene product the carbapenem biosynthetic genes are not expressed, that is, they remain silent or cryptic.

Our co-pending application GB-9311641.6 relates to the carR gene product and to DNA sequences encoding it.

We therefore used direct cosmid complementation of a strain of class 1 mutants (strain PNP 14) to clone a complementing cosmid (cWU142). We presumed that this would carry all the biosynthetic genes. When cWU142 was transferred into other class 1 mutants it complemented all of them. cWU142 did not complement any class 2 mutants (this is because the carI gene is physically separate from the biosynthetic genes). We showed that the cWU142 cosmid carries the gene (carR) encoding the LuxR homologue. The CarR protein acts as a positive activator of the car biosynthetic genes and functions in trans. This carR gene is capable of activation of carbapenem antibiotic production in several other Erwinia strains (approx. 18-20% of random strains). One of the class 1 mutants (PNP21) was not complemented by the carR gene alone and was presumed to be a real biosynthetic mutant. cWU142 complemented this mutant, implying that the car biosynthetic genes were indeed on the cosmid. This was confirmed by the observation that, in E. coli, cWU142 could encode carbapenem synthesis simply by the exogenous addition of chemically-synthesised OHHL. It seems that the car biosynthetic genes are not expressed because the CarR protein requires activation by the OHHL although the requirement for OHHL can be over-ridden by producing CarR in high dosage e.g. by providing the carR gene in trans on a multicopy plasmid. In Erwinia carotovora there is endogenous OHHL so the car genes are switched on.

Thus all genes for antibiotic production and one for regulation (carR) were on cWU142, permitting the carbapenem antibiotic biosynthetic pathway to be reconstituted in a heterologous organism (in this case E. coli).

The cWU142 cosmid was subcloned to produce smaller plasmids which still express the antibiotic. Plasmid pSMG12 encoded antibiotic synthesis in E. coli in the presence of exogenously added OHHL. Plasmid pSMG13 was engineered to contain the carI gene in addition to the genes found in pSMG12. Plasmid pSMG13 encoded carbapenem production without the need for exogenously added OHHL. This strain makes OHHL (from the carI gene product) and apparently acts with CarR to transcriptionally activate the carbapenem biosynthetic genes. Further subcloning of the pSMG13 plasmid has shown that the cluster of 9 genes carR,carA,carB,carC,carD,carE,carF,carG,carH is sufficient to express carbapenem in E. coli if OHHL is added. OHHL is not required if the carI gene is also present and functional. All of the Erwinia DNA in the pSMG12/pSMG13 plasmid has been sequenced to define the genes carR to carH.

According to one aspect of this invention we provide, either separately or in combination, DNA comprising the genes carA,carB,carC,carD,carE,carF,carG,carH.

Preferably, we provide DNA comprising at least the CarA, CarB and CarC genes. More preferably we provide DNA comprising the CarA, CarB, CarC, CarD and CarE genes. Also included within the invention are DNA capable of hybridising to any of the DNAs defined above and capable of functioning as such DNAs in the carbapenem biosynthetic pathway, and DNAs which are equivalent by virtue of the degeneracy of the genetic code.

The CarF and CarG genes are now known to be involved in conferring resistance to the antibiotic produced by the organism.

The DNA may also include either or both of the regulatory genes carI and carR.

The DNA according to the invention may comprise genes of Erwinia carotovora, or DNA capable of hybridising to any of said genes and capable of functioning as such genes in the biosynthetic pathway of a carbapenem.

Such hybridising DNA may comprise native genes from other carbapenem-synthesising organisms or partially or wholly synthetic DNA.

We also provide DNA which is equivalent to any DNA mentioned above by virtue of the degeneracy of the genetic code.

Portions of the Erwinia carotovora genes or corresponding genes from other organisms are included provided that such portions can function in carbapenem synthesis.

We also provide a polypeptide coded for by DNA as defined hereinabove.

We further provide a recombinant DNA molecule including any DNA as defined hereinabove.

We also provide a recombinant DNA molecule, including the cluster of genes carA,carB,carC,carD, carE,carF,carG,carH. Such a recombinant DNA molecule may also contain either or both of the regulatory genes carR and carI, or may be placed under the control of a heterologous regulated promoter. The recombinant DNA molecule is preferably an expression vector capable of expressing a carbapenem coded for by said gene cluster.

In such a recombinant DNA molecule, any of the above-mentioned genes may be replaced by DNA capable of hybridizing thereto and which is itself capable of functioning in the carbapenem biosynthetic pathway.

Any of the above-mentioned genes may be replaced by DNA which is equivalent to it by virtue of the degeneracy of the genetic code.

Examples of recombinant DNA molecules as mentioned above are the cosmid cWU142 and plasmids pSMG12 and pSMG13.

We also provide a method of identifying further DNA coding for production of a carbapenem by using a DNA sequence as defined hereinabove, and preferably at least a substantial part of one or more of the genes carR carA,carB,carC,carD, caE,carF,carG,carH as a hybridization probe, incorporating any further DNA so identified into an expression vector, transforming a host with the resulting recombinant expression vector, expressing the polypeptide coded for by said further DNA, and testing said polypeptide for carbapenem biosynthetic activity.

The known carbapenem genes or portions thereof may thus be used to search for homologues in other bacteria as well as other organisms such as fungi. The procedure is particularly appropriate for searching for carbapenem gene homologues in streptomycetes producing Thienamycin and other carbapenems. The portions of the above-mentioned genes to be used as hybridization probes may if necessary be modified by replacing codons by codons coding for the same amino acids but more appropriate to the organism under investigation.

The Erwinia carotovora strain used throughout was Erwinia carotovora subspecies carotovora (ATCC39048).

An internal StuI fragment from the car cluster of the strain of Erwinia cartovora described above (containing DNA internal to carA through to a site internal to carG) was used to probe chromosomal DNA from various other Erwinia carotovora strains. Of 16 strains probed, 10 yielded positive hybridisation signals. The StuI probe was also used to probe DNA from other Erwinia carotovora subspecies, and bacteria of other genera, namely Serratia marcescens and Streptomyces cattleya. Positive signals were seen with Erwinia carotovora subspecies atroseptica (strain SCRI39), Erwinia carotovora subspecies betavasculorum (strain SCRI479), Serratia marcescens (ATCC39006) and Streptomyces cattleya (strain 8057). We therefore believe that those strains showing positive hydridisation signals carry genes homologous to the car genes of our original strain (ATCC39048). In fact we have established that Serratia marcescens (ATCC39006) produces exactly the same antibiotic as the original Erwinia carotovora strain. We have determined the sequence of the complete Car cluster from Serratia marcescens (ATCC39006) and all of the genes are highly homologous to the corresponding genes of Erwinia carotovora (ATCC39048).

We have also performed PCR amplificiation of carC homologues in Streptomyces cattleya. The rationale for this is that antibiotic synthesis (and resistance) genes are usually clustered in bacteria and so cloning part of any one gene in the Streptomyces cluster then provides a "probe" with which to clone the rest of the cluster.

In addition to knowing the total car cluster sequence from Erwinia carotovora, we also cloned and sequenced the homologous sequences from Serratia marcescens (strain ATCC39006). As with the Erwinia cluster, the equivalent car cluster from Serratia also encodes carbapenem synthesis when cloned in E. coli. Two 18mer oligonucleotide primers were designed based on totally conserved amino acids in the predicted protein sequence of the Erwinia carotovora and Serratia marcescens carC genes (strep-C1 and strep-C2).

Oligonucleotide Primer Sequences

Strep-C1 CCG C/GGA CAT CTA CTC C/GGC (SEQ. ID NO:11)

Strep-C2 CCG TCC TCC CAG AAG TGC (SEQ. ID NO:12)

The two conserved regions chosen spanned DNA encoding 185 amino acids of the predicted Erwinia gene product. In order to bias the primers for use with Streptomyces DNA, which has a high ratio of G and C residues, the amino acid sequence was back translated using G and C residues where possible. This resulted in the oligonucleotides being of mixed sequence where the amino acid could be encoded by more than one DNA sequence.

PCR reactions were performed using the above primers. A number of sets of reactions were performed with varying concentrations of template DNA, primer DNA, dNTPs and magnesium chloride. The denaturing time, annealing temperature and ramp rate of the PCR programme were also varied.

A PCR product of approximately 500 base pairs was seen in some of the reactions using the primers strep-Cl and strep-C2 and the Streptomyces cattleya DNA as template.

The carA to carH gene products may be used to make antibodies for use in identifying in a library streptomycete clones expressing a cross-reacting protein. This is a possible route to the identification of new car genes and new carbapenem antibiotics.

EXAMPLES

The cloning of the carbapenem biosynthetic genes will now be described in more detail and with reference to the accompanying figures, by way of example only:

FIG. 1 shows schematically the cosmid cWU142, showing the organisation of the car genes R,A,B,C,D,E,F,G,H, as well as unrelated genes (X,T,R') and the three PstI restriction sites. Cosmid cWU142 was obtained by cloning the carbapenem biosynthetic genes into the low copy number plasmid pSF6.

cWU142 was digested with PstI, to yield three large (ie greater than 12 kb) fragments. Each of these three fragments was cloned individually in pUC9. The clone that was shown by PCR analysis to contain carR was named, pSMG10. The fragment within this construct was subsequently recloned using PstI in pACYC177 and this construct was named pSMG12 (FIG. 2).

The carI gene had previously been cloned in pSJ10 and been shown to be present on a 2.3 kb PstI fragment (S. Jones). This carI containing fragment was cloned into pSMG12 following partial digestion of this plasmid with PstI. The resulting construct was named pSMG13 (FIG. 3).

E. coli containing plasmids pSMG12 or pSMG13 was spotted onto a lawn of the carbapenem hyper-sensitive strain of E. coli ("Beecham's E. coli"). A zone of clearing around the pSMG13-containing strain indicated the production by this strain of carbapenem. When exogenous N-(3-oxohexanoyl)-L-homoserine lactone (OHHL) was added to both strains, then the pSMG12 carrying strain could also produce carbapenem.

FIG. 4 shows the entire sequence of the carR,carA,carB,carC,carD,carE,carF,carG,carH genes, together with the sequences of the polypeptides that they encode. The sequences of some upstream and downstream DNA are also shown, including a portion of the orfX gene which, like the adjacent kdgT and R¹ -genes do not form part of the carbapenem biosynthetic pathway.

From complementation and cross feeding studies, involving the mutants and cosmids of both Serratia marcescens (Sma) and Erwinia carotovora subspecies cartovora (Ecc) it appeared that all the genes necessary for carbapenem biosynthesis were also contained on 2 Sma cosmids, pNRT1 and pNRT20. These Sma carbapenem cosmids seemed to be functionally interchangeable in many cases with the Ecc cosmid cWU142, and it seemed likely that putative carbapenem genes might be quite highly conserved between the two genera. Southern blot hybridisations were used to localise the carbapenem gene region on the cosmids and this facilitated the subcloning and sequencing of the Sma carbapenem genes. 8.7 kb from the Sma carbapenem gene region was sequenced and was found to contain 9 ORFs, carRABCDEFGH, all transcribed in the same direction, with carABCDEFGH possibly forming an operon. The 9 genes had both similar size and high DNA homology to the 9 genes sequenced from the carbapenem gene region of Ecc. The predicted gene products also showed high amino acid sequence homology with the corresponding gene products from Ecc.

    __________________________________________________________________________     SEQUENCE LISTING     (1) GENERAL INFORMATION:     (iii) NUMBER OF SEQUENCES: 12     (2) INFORMATION FOR SEQ ID NO: 1:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 8341 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (iii) HYPOTHETICAL: NO     (iv) ANTI-SENSE: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:     AGATCGGCGCACTGCGTGCCTTTGGCGACGCCGCGTTACGGCATTTTCACGCTGATTTGG60     AGCGGGCTGATGTCGCTGGTGAGCGTCGGCGTGCTGCTGGGTGTCGGTCTGGGCTACCTT120     GCCGCCCGCGCGATTGCGGTGGTGATGAGTGAGAAAAGCGGTTTTGTCCTGCCGGTGACG180     TTGGAATGGGAAGACATCCACTTCGTCCTGCTCCTGTTGCTGGTCGCCGCTGTTGTCCTG240     ACGATTCCGGCGATGCTGTCCTACCGTCAATCTCCTGCTACGGCGCTGCGTGGCGAGTAA300     TAGGGTTGCGACTCAGTCGCAAAACTAAGCTGTAGTGGCATCGGACGTGTAGTTCTCTGC360     GTCCGATGCTGTTTTTTCCTTACCTATTCATCCCTTCCGATTCTCATACTCACTCAACGG420     CCTTCGTCCTTCGGTATTATGGTTTGGGGCCAGTAACGAGCAGTAATACTGTTACTTATT480     TCGCTAAATAATATTTATAATAATAGTAACTCTATCAATTATTATTGAATTATTAGTATA540     TAAAATGTTGACTGATTGTATATTAATTGATAACTTTCGACCTTGTTAAATCCTAGTGAT600     TATATTTGGTGTAACTAAAAATAATTATATTTACTCTCTGGTAAAGAGTTGATCTTTTAA660     TCTTTGAGCAAAGTCGGTAAGAGAGGGTAATATGGATCATGAAATCCATTCCTTTATCAA720     AAGGAAGTTGAAAGGAGTCGGTGATGTATGGTTTTCTTATTTTATGATGAGTAAAAACTC780     TACCAGCCAACCTTATATTATTTCGAATTATCCAGAAGCATGGATGAAGGAGTATATAAA840     AAAAGAGATGTTTCTGAGTGATCCTATCATTGTTGCCTCATTAGCTCGGATCACGCCGTT900     TTCTTGGGATGATAATGATATTGTGACGCTAAGAGCCAAGAATCAGGATGTCTTTATTTC960     TTCCGTGCAGCACGATATAAGTTCAGGTTATACCTTTGTTTTGCACGACCATGATAATAA1020     TGTGGCGACACTGAGTATAGCGAATCACTTGGAAGATGCGAATTTCGAAAAATGTATGAA1080     GAATCATGAAAATGATTTGCAGATGTTACTTGTGAATGTACATGAAAAAGTGATGGCCTA1140     TCAGCGAGCTATCAACGATCAAGATAACCCCCCCGATAATTCAAGAAACGCCTTACTCTC1200     TCCGCGTGAAACCGAAGTTCTTTTCCTGGTTAGTAGTGGACGAACTTACAAAGAGGTTTC1260     CCGTATATTAGGTATTAGTGAGGTCACCGTTAAGTTCCACATTAACAACTCAGTCCGTAA1320     ATTGGATGTTATCAATTCCCGCCATGCTATAACTAAAGCACTTGAGTTAAATCTTTTCCA1380     TTCCCCCTGTGAACCTGTAGTGATGAAGCATATGGACGCCCGTTAGCGTGTATTAATGCT1440     TGATAATAGGGAGGTACCTATCCAAAAGAATAGTAACTCCCTATCTAAAAGAAATACATT1500     AGCTGAACTTTTACACGGTTAATATTTACCCTCGCCTTTTCTATGATATGTGACGAATTC1560     AAACGTTGTTTTCTCTCATGTTGTTATTAAGGTAAGGGTTATTACTGTGAGCAATAGTTT1620     TTGCGTTGTTTATAAAGGTTCTGATACCGATATAAATAATATCCAACGCGACTTCGACGG1680     AAAGGGCGAACGATTATCTAATGGCTATCTTTTTATCGAACAGAATGGCCATTATCAGAA1740     GTGTGAGATGGAAAGAGGAACGGCCTACCTGATAGGCTCGCTGTACAATCGGACGTTTCT1800     GATCGGATTGGCCGGTGTGTGGGAAGGCGAGGCTTATCTGGCAAATGATGCCGAGCTGTT1860     AGCGTTGCTGTTCACGCGTTTGGGAGCGAATGCACTGGCGCTGGCTGAAGGTGCATTCTG1920     CTTTTCTTATGAGCCAAACGGCGAATTGACGGTGATTACCGAGTCGCGTGGTTTCTCGCC1980     GGTTCATGTCGTACAGGGCAAAAAAGCCTGGATGACCAATAGCCTTAAACTGGTTACTGC2040     GGCAGAAGGTGAAGGCGCGCTGTGGTTTGAAGAAGAGGCGTTGGTGTGCCAGTCGCTGAT2100     GCGAGCGGATACCTATACGCCGGTGAAAAATGCGCAGCGTCTTAAGCCGGGAGCGGTGCA2160     TGTTCTTACGCACGATAGCGAAGGTTATTCTTTCGTTGAAAGCCGCACGCTGACCACACC2220     AGCCAGCAACCAATTGTTAGCGCTCCCGCGTGAACCGCTGCTGGCATTGATTGATCGCTA2280     CCTTAATGCTCCGCTTGAGGATTTAGCGCCGCGCTTTGATACCGTAGGAATTCCCTTGTC2340     AGGCGGTCTGGATTCCAGCCTGGTAACGGCGCTCGCCAGTCGTCATTTCAAAAAATTGAA2400     TACGTATTCGATTGGTACGGAACTCAGCAATGAGTTTGAGTTTTCTCAACAGGTTGCTGA2460     TGCACTCGGTACACATCATCAGATGAAAATTCTGTCCGAAACTGAAGTGATCAACGGCAT2520     CATCGAATCCATCTATTACAACGAAATATTTGACGGCTTATCCGCTGAAATCCAATCCGG2580     GTTGTTCAATGTCTATCGTCAGGCTCAGGGGCAGGTGTCTTGCATGCTCACCGGATATGG2640     TTCCGACCTGCTCTTTGGCGGCATACTGAAACCAGGAGCGCAGTATGACAATCCGAATCA2700     GCTGCTTGCCGAGCAAGTGTACCGGACGCGTTGGACAGGGGAGTTTGCTACCCACGGTGC2760     TTCCTGTTACGGCATTGATATTCGCCACCCCTTCTGGAGCCATTCCCTAATCTCTCTATG2820     TCATGCGCTACATCCTGATTACAAAATTTTCGACAACGAAGTCAAAAACATCCTGCGTGA2880     ATACGCCGATTCGCTGCAATTGCTGCCGAAAGACATTGTCTGGCGCAAGAAAATCGGCAT2940     TCACGAAGGTTCCTCCGTCAATCAGGCCTTTGCGAATGTTCTCGGGTCAACGGTTGATAA3000     CTACCAGACCAAAAGTCGCTTTACCTACCGTGTTTATCAAGCCTTCCTTCGTGGCCGTCT3060     CTCCATTACAGATGTGACGCCCTCTCAGCTTAAAGATCTGATTAAAAAGGATTAATTATG3120     GTTTTTGAAGAGAATTCCGATGAGGTCCGAGTGATCACGCTCGATCATCCGAACAAGCAT3180     AACCCCTTTAGTCGAACGCTGGAAACCAGCGTGAAGGACGCGCTGGCGCGAGCCAACGCT3240     GACGACAGCGTACGAGCTGTCGTGGTGTATGGCGGGGCCGAGCGTTCCTTTTCCGCTGGC3300     GGAGATTTCAACGAAGTTAAGCAGCTATCGCGCAGCGAGGACATCGAAGAGTGGATTGAC3360     CGCGTGATTGATTTGTATCAGGCCGTCCTGAATGTGAATAAACCGACGATCGCAGCGGTG3420     GATGGCTATGCGATTGGTATGGGTTTCCAGTTCGCCCTGATGTTTGACCAGAGACTGATG3480     GCGTCGACAGCAAATTTTGTGATGCCGGAACTTAAGCATGGTATCGGGTGCTCGGTTGGG3540     GCGGCCATTCTGGGATTCACCCACGGATTTAGCACAATGCAGGAAATCATCTACCAGTGC3600     CAGTCGCTCGATGCTCCACGCTGTGTGGACTACCGGCTGGTTAATCAGGTGGTGGAGAGC3660     AGTGCGCTGCTGGATGCCGCTATCACACAGGCGCACGTGATGGCTAGCTACCCGGCTTCT3720     GCATTCATCAATACGAAACGAGCGGTTAACAAACCGTTCATCCATCTACTGGAACAAACC3780     CGTGACGCTTCCAAAGCTGTCCATAAGGCAGCGTTCCAGGCTCGTGACGCTCAGGGACAT3840     TTCAAAAATGTGCTTGGCAAAAAATACTGAGGATAAGGAAGAAAATGAGCGAAATAGTGA3900     AGTTTAATCCGGTCATGGCATCCGGTTTTGGCGCGTATATCGATCATCGGGACTTTCTCG3960     AAGCCAAGACAGAAACGATTAAAAACTTACTGATGCGTCAGGGATTTGTCGTGGTCAAAA4020     ATCTCGATATTGATAGTGACACATTCCGCGATATCTACTCGGCTTACGGTACGATCGTGG4080     AGTATGCGGATGAAAAGATCGGCGTAGGTTTCGGCTATCGCGATACGTTGAAGCTGGAAG4140     GGGAAAAAGGAAAAATCGTTACCGGACGTGGTCAACTTCCTTTCCACGCTGATGGCGGCC4200     TGCTGCTGTCACAGGTAGATCAGGTTTTCCTCTACGCGGCCGAGATTAAAAACGTCAAAT4260     TCCGTGGTGCAACAACGGTATGCGACCATGCTCTGGCTTGTCAGGAAATGCCGGCTCACC4320     TTCTGCGCGTACTGGAAGAGGAAACGTTCGAAGTTCGTGTGCTAGAGCGGGGCTATTACG4380     TGGATGTTTCACCAGACGGTTGGTTCAAGGTGCCGGTCTTCACCGATCTGGGATGGGTCA4440     GAAAGATGCTGATTTATTTCCCATTTGACGAAGGACAACCGGCTAGCTGGGAGCCGCGGA4500     TTGTCGGTTTCACCGATCATGAAACCCAGGCATTCTTCCAGGAACTCGGGGCGTTTTTAA4560     AACAGCCACGCTACTACTACAAACACTTCTGGGAAGATGGTGACCTGCTGATTATGGACA4620     ACCGTCGTGTCATTCATGAGCGTGAAGAGTTTAACGATGACGACATCGTACGTCGTCTGT4680     ATCGCGGACAAACCGCCGATATCTAACTCTCATGATTCCGGCGCGAAAACGTGCCGGATT4740     GATATGGATCTGGAACCATCATGCCTAACGATCTTTACGCTATCTACAATCGTTACACGT4800     CAAGGACCCTGTTTTTTAAATACTGTGCGACAGCGACGCTAACGCACAGACTGACACGTC4860     GGCTGTCGCTTTTTACGTTGAAAAAGTGCCTTGCCCGTCCACAGGGACGCCTTTTCTCTC4920     TGGTTAATAGCATTTATTTTGGCGGAGAAACGCTGGAAGAGGTTCAGAGTACAGCAACTT4980     TTCTGGCACGCTCTGGCATTGCTTGCGTACTGGACTACGCCGTGGAAGGGGAAAATGACG5040     AGACGCAGTTCGATAAGGCAATGGAAAATACGCTGCGCCTTATCGAGATGTCGCAGCAAA5100     CGGATAGCCTGCCTTTTGTCGTGATTAAACCGTCGTCTCTGGGTAGTGTTGCTGTGTATG5160     CTCGGCAAAGTGAAAGGCTCGCGCTGGATGAAGCATCCGCCAGCGCATGGTCACGCATCG5220     TTACGCGTTTCTCACGTCTGTTCGATTATGCCCGCTCTCATGGCGTGCATGTGATGGTTG5280     ATGCAGAGCAGACCGCGATTCAGCCCGCAGTGGATCGTTTGGTTCTAGACATGATGCGTG5340     AATTTAACCGCGATAGCGCGGTGATCACGCTGACGCTGCAATTTTATCTAAAGGATCAAT5400     TGCGTTTTCTCGACGAATGCTATCAGCGAGCCTGTCAGGATAATTTCCTGTTCGGCGTGA5460     AAGTAGTGCGCGGTGCTTATCTGGAAGAAGAGAAACGAGTGAACGGTGGCGTACGCTGTT5520     TTGCAACCAAACAGGAAACCGATCGTAGTTATAACGCGGCGGTGGATTACATTGCACTGC5580     GTCTGGATCGAATCGCACCGTTTTTCGCCACGCACAACGAGGAAAGCCTCGCTTTGATTA5640     TGTCGAGTGAGTCGCTACGCGCGGGGCGCACCTGGGTAGGCCAGCTTTACGGGTTGGGCG5700     ATCATATTACTTATTCGCTGCTGCAAACCGGTTTCCGCGTGTGCAAATACCTGCCTTACG5760     GTCCGCTCGATAAGTCGTTGCCCTATTTACTCCGCAGGATTGAAGAGAACGCGGTTGCCT5820     CGGCAACCTTCAAAAAGGAAAATAAACTCTTGCAGAAGGAGTTGCTGCGCCGTCTCGTAG5880     GAGGGATGTAATGTCAAAATTTCATTGTCATATTCGCGATACGGCTATCGGTTTCCCCGT5940     GTCGGAGGATGAAAGCGTACTCAGTTCAGCCTATGAAGCTGGTGTCGAACTGCCTTATCG6000     CTGTGCTTCTGGTTACTGTGGTGTATGCAAGGTGCGCCTGACGTCCGGCAACGTCAACAT6060     GGATCATTCTGGTGGGATCTCACGTAAAGATATCGCTGATGGTTACATTCTACCTTGCTG6120     TTCTGTGCCGCTCTCTAACCTCGAAATTGAACCTGTATCGTCATGCTGAAGAAAACGCTA6180     ATTGTTGGACTGTGCTGTACTTTCCCCCTCTTCTCCGCTCAAGCGGTGAATACTGTTCCT6240     GACGAGGTGGTTGTCAAAGGTGGCAATTTCTATGTTGGATCGGTCTTCGGCTCGGAAGAC6300     TATGCTGCTCATGCCAACACGTCTATCGCGTCTTTTGCCATCACGAAAACGGAAATTACC6360     TATCGGCAGTATCATGCGTTGCAAGAATGGGCGGACACACACGGTTATGAGCTTAGTGGC6420     GGCTGCAATGGCGCGACTTTCGAAGATTGCTTACCGCCTGAACAGGATAACAGTCTTCAT6480     CCTGTCACGAATGTATCTTGGTGGGATGCAGTGATTTTTGCCAATGTGCTCAGTGAACGG6540     CAGCAGTTGCAGCCCTATTACCTCACGATTGACGGTAAGACGCTGAAACGCGTGCCGGAG6600     GACGATAACGATAAACTCATACGCGAGAACCCGCAGGCTTTGGGGTATCGCTTGCCGACG6660     CTGGCGGAATGGCAAGTGGCAGCCAGAGGCGGAAAGAAGGGGCTGGCGCAGGGGACGTAC6720     GGACAGCGCTATGCCGGTAGCGAACAGCCGGACAGTGTGGCGCATTTTCCTTCTGACTCT6780     CAATCATTCGGCACGGTGCCTGTGACCTCGAAGCGCCCCAATGCTCTGGGGCTTTACGAT6840     ATGAGTGGCAATGTATCCGAGTGGCTGAATGAGTCTTATGCGGTGGAAGGCGGCAAAACC6900     ATGTACTACTTTTGCGGCGGAAGTTATCTGGAACGCACGCACAGTCTTGCTAGCTGCGAT6960     CTGCATACGCCCGGTTTTTTCATGCCGGATATTGGTTTTCGACTAGTAAGGACACTTGAT7020     GGTCAATAAGTTCGTAGGGTGGCTGGCGCTCTGTGCCATTAGCAACACGGCTGCTGCACT7080     CTCTCCTGTGACATTGAAAGACGGCATCAATCGGCTGGATCTCAATCAGGGCGGCGGGAA7140     CGATTATGTCGTTGTTGCCCAATTTGACAACAATACCTCGCATCCTAATCTCGGGATGAC7200     CTTTTTTGTCCGCCGTCCAGACGGTGGGCATAGCATTATGCCGGTAGCCAACAGCAATAC7260     GTTTACCTGGTTCGACTATCGCCTGTCCGCTGCGGCGGATTTTCTGGTGCAGGACAATCG7320     GCTATTCCTGTCTGGAAAGCATTACTTTCTGGTGACGGCACGGAAGCAGGGTGAAAACGT7380     CTTTGATCCCACAAAAGTCGTTTTAACAATTTACGACTTTAAAGCCTCACGGGACGATCC7440     CGGTGTACCACTCTATGAATGGTCGGAGCGCAAGCGTGTGGTAACACAAGATACCTACCA7500     ATCGGTCGATGAGGCCTACAAGGAAGTGAATGAGGCGATGCTGGCAAAATGAAAATATCG7560     GTATTAATCGCATCCGGGCTATTGGCATCTTTCGGGGCGTGGGCGCAACTCAGTGAGCAG7620     GATTACCAGCAACGCGTACAGGAGTTTTTTGATGCAGAACCGCCGCTGTGTCTGGGCGAG7680     AAGCAGTGGCCGGTTCATAGTCCCAAAGGTGATTCTCCCTGGAACAGTGGACGGTTGCAT7740     ACGCTTGTGGAAGCGGGGTTGGCATACGCGACGCCTGAAGGAACAGGAAAGGTATATCGG7800     TTATCGCCTGTAGGTGAGAAGAATTGGCGTCAACATGGCGACCTTTGCTATGGCCGTATG7860     CAAGTGAGCCGTATTGAGAAAATCGATCGTGTGAACCAGGAACTGACGGTAGTGTATTTC7920     ACTTATCATCTGACATCGTTGGAAAGTTGGGCTCATAACCGTTCGTTACGCTTTGCTTTT7980     AGCGAGTTGGATAATCTTGTCGGTGGAATGGAAACCACGCGTTATTCCGCTACGATTCGT8040     GAGACGTTGGGCGGGGCGGCCAAATTGCAGGATTACCCCGTGCCGGTAGAGCTGGATTAC8100     TAATCTGAAATCTTATTGACGTGACATAAAAAAAACAGGGTGAACGTGGTTCACCCTGCT8160     GCATTTTACTGTCTTGGCGTGACGCCAGAGGGGATTAGCGTAAATCCAGCATCGCGATAT8220     GGTCCATATCGTCAAACGTTAGGTTTTCGCCGATCATGCCCCAGATAAAGGCATAGTTTT8280     TGGTGCCGACACCGGTGTGAATCGACCAACTCGGGGAGATCACGGCCTGCTCGTTATGCA8340     T8341     (2) INFORMATION FOR SEQ ID NO: 2:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 244 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (iv) ANTI-SENSE: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:     MetAspHisGluIleHisSerPheIleLysArgLysLeuLysGlyVal     151015     GlyAspValTrpPheSerTyrPheMetMetSerLysAsnSerThrSer     202530     GlnProTyrIleIleSerAsnTyrProGluAlaTrpMetLysGluTyr     354045     IleLysLysGluMetPheLeuSerAspProIleIleValAlaSerLeu     505560     AlaArgIleThrProPheSerTrpAspAspAsnAspIleValThrLeu     65707580     ArgAlaLysAsnGlnAspValPheIleSerSerValGlnHisAspIle     859095     SerSerGlyTyrThrPheValLeuHisAspHisAspAsnAsnValAla     100105110     ThrLeuSerIleAlaAsnHisLeuGluAspAlaAsnPheGluLysCys     115120125     MetLysAsnHisGluAsnAspLeuGlnMetLeuLeuValAsnValHis     130135140     GluLysValMetAlaTyrGlnArgAlaIleAsnAspGlnAspAsnPro     145150155160     ProAspAsnSerArgAsnAlaLeuLeuSerProArgGluThrGluVal     165170175     LeuPheLeuValSerSerGlyArgThrTyrLysGluValSerArgIle     180185190     LeuGlyIleSerGluValThrValLysPheHisIleAsnAsnSerVal     195200205     ArgLysLeuAspValIleAsnSerArgHisAlaIleThrLysAlaLeu     210215220     GluLeuAsnLeuPheHisSerProCysGluProValValMetLysHis     225230235240     MetAspAlaArg     (2) INFORMATION FOR SEQ ID NO: 3:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 521 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:     ValThrAsnSerAsnValValPheSerHisValValIleLysValArg     151015     ValIleThrValSerAsnSerPheCysValValTyrLysGlySerAsp     202530     ThrAspIleAsnAsnIleGlnArgAspPheAspGlyLysGlyGluArg     354045     LeuSerAsnGlyTyrLeuPheIleGluGlnAsnGlyHisTyrGlnLys     505560     CysGluMetGluArgGlyThrAlaTyrLeuIleGlySerLeuTyrAsn     65707580     ArgThrPheLeuIleGlyLeuAlaGlyValTrpGluGlyGluAlaTyr     859095     LeuAlaAsnAspAlaGluLeuLeuAlaLeuLeuPheThrArgLeuGly     100105110     AlaAsnAlaLeuAlaLeuAlaGluGlyAlaPheCysPheSerTyrGlu     115120125     ProAsnGlyGluLeuThrValIleThrGluSerArgGlyPheSerPro     130135140     ValHisValValGlnGlyLysLysAlaTrpMetThrAsnSerLeuLys     145150155160     LeuValThrAlaAlaGluGlyGluGlyAlaLeuTrpPheGluGluGlu     165170175     AlaLeuValCysGlnSerLeuMetArgAlaAspThrTyrThrProVal     180185190     LysAsnAlaGlnArgLeuLysProGlyAlaValHisValLeuThrHis     195200205     AspSerGluGlyTyrSerPheValGluSerArgThrLeuThrThrPro     210215220     AlaSerAsnGlnLeuLeuAlaLeuProArgGluProLeuLeuAlaLeu     225230235240     IleAspArgTyrLeuAsnAlaProLeuGluAspLeuAlaProArgPhe     245250255     AspThrValGlyIleProLeuSerGlyGlyLeuAspSerSerLeuVal     260265270     ThrAlaLeuAlaSerArgHisPheLysLysLeuAsnThrTyrSerIle     275280285     GlyThrGluLeuSerAsnGluPheGluPheSerGlnGlnValAlaAsp     290295300     AlaLeuGlyThrHisHisGlnMetLysIleLeuSerGluThrGluVal     305310315320     IleAsnGlyIleIleGluSerIleTyrTyrAsnGluIlePheAspGly     325330335     LeuSerAlaGluIleGlnSerGlyLeuPheAsnValTyrArgGlnAla     340345350     GlnGlyGlnValSerCysMetLeuThrGlyTyrGlySerAspLeuLeu     355360365     PheGlyGlyIleLeuLysProGlyAlaGlnTyrAspAsnProAsnGln     370375380     LeuLeuAlaGluGlnValTyrArgThrArgTrpThrGlyGluPheAla     385390395400     ThrHisGlyAlaSerCysTyrGlyIleAspIleArgHisProPheTrp     405410415     SerHisSerLeuIleSerLeuCysHisAlaLeuHisProAspTyrLys     420425430     IlePheAspAsnGluValLysAsnIleLeuArgGluTyrAlaAspSer     435440445     LeuGlnLeuLeuProLysAspIleValTrpArgLysLysIleGlyIle     450455460     HisGluGlySerSerValAsnGlnAlaPheAlaAsnValLeuGlySer     465470475480     ThrValAspAsnTyrGlnThrLysSerArgPheThrTyrArgValTyr     485490495     GlnAlaPheLeuArgGlyArgLeuSerIleThrAspValThrProSer     500505510     GlnLeuLysAspLeuIleLysLysAsp     515520     (2) INFORMATION FOR SEQ ID NO: 4:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 250 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4:     MetValPheGluGluAsnSerAspGluValArgValIleThrLeuAsp     151015     HisProAsnLysHisAsnProPheSerArgThrLeuGluThrSerVal     202530     LysAspAlaLeuAlaArgAlaAsnAlaAspAspSerValArgAlaVal     354045     ValValTyrGlyGlyAlaGluArgSerPheSerAlaGlyGlyAspPhe     505560     AsnGluValLysGlnLeuSerArgSerGluAspIleGluGluTrpIle     65707580     AspArgValIleAspLeuTyrGlnAlaValLeuAsnValAsnLysPro     859095     ThrIleAlaAlaValAspGlyTyrAlaIleGlyMetGlyPheGlnPhe     100105110     AlaLeuMetPheAspGlnArgLeuMetAlaSerThrAlaAsnPheVal     115120125     MetProGluLeuLysHisGlyIleGlyCysSerValGlyAlaAlaIle     130135140     LeuGlyPheThrHisGlyPheSerThrMetGlnGluIleIleTyrGln     145150155160     CysGlnSerLeuAspAlaProArgCysValAspTyrArgLeuValAsn     165170175     GlnValValGluSerSerAlaLeuLeuAspAlaAlaIleThrGlnAla     180185190     HisValMetAlaSerTyrProAlaSerAlaPheIleAsnThrLysArg     195200205     AlaValAsnLysProPheIleHisLeuLeuGluGlnThrArgAspAla     210215220     SerLysAlaValHisLysAlaAlaPheGlnAlaArgAspAlaGlnGly     225230235240     HisPheLysAsnValLeuGlyLysLysTyr     245250     (2) INFORMATION FOR SEQ ID NO: 5:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 273 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:     MetSerGluIleValLysPheAsnProValMetAlaSerGlyPheGly     151015     AlaTyrIleAspHisArgAspPheLeuGluAlaLysThrGluThrIle     202530     LysAsnLeuLeuMetArgGlnGlyPheValValValLysAsnLeuAsp     354045     IleAspSerAspThrPheArgAspIleTyrSerAlaTyrGlyThrIle     505560     ValGluTyrAlaAspGluLysIleGlyValGlyPheGlyTyrArgAsp     65707580     ThrLeuLysLeuGluGlyGluLysGlyLysIleValThrGlyArgGly     859095     GlnLeuProPheHisAlaAspGlyGlyLeuLeuLeuSerGlnValAsp     100105110     GlnValPheLeuTyrAlaAlaGluIleLysAsnValLysPheArgGly     115120125     AlaThrThrValCysAspHisAlaLeuAlaCysGlnGluMetProAla     130135140     HisLeuLeuArgValLeuGluGluGluThrPheGluValArgValLeu     145150155160     GluArgGlyTyrTyrValAspValSerProAspGlyTrpPheLysVal     165170175     ProValPheThrAspLeuGlyTrpValArgLysMetLeuIleTyrPhe     180185190     ProPheAspGluGlyGlnProAlaSerTrpGluProArgIleValGly     195200205     PheThrAspHisGluThrGlnAlaPhePheGlnGluLeuGlyAlaPhe     210215220     LeuLysGlnProArgTyrTyrTyrLysHisPheTrpGluAspGlyAsp     225230235240     LeuLeuIleMetAspAsnArgArgValIleHisGluArgGluGluPhe     245250255     AsnAspAspAspIleValArgArgLeuTyrArgGlyGlnThrAlaAsp     260265270     Ile     (2) INFORMATION FOR SEQ ID NO: 6:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 376 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:     MetProAsnAspLeuTyrAlaIleTyrAsnArgTyrThrSerArgThr     151015     LeuPhePheLysTyrCysAlaThrAlaThrLeuThrHisArgLeuThr     202530     ArgArgLeuSerLeuPheThrLeuLysLysCysLeuAlaArgProGln     354045     GlyArgLeuPheSerLeuValAsnSerIleTyrPheGlyGlyGluThr     505560     LeuGluGluValGlnSerThrAlaThrPheLeuAlaArgSerGlyIle     65707580     AlaCysValLeuAspTyrAlaValGluGlyGluAsnAspGluThrGln     859095     PheAspLysAlaMetGluAsnThrLeuArgLeuIleGluMetSerGln     100105110     GlnThrAspSerLeuProPheValValIleLysProSerSerLeuGly     115120125     SerValAlaValTyrAlaArgGlnSerGluArgLeuAlaLeuAspGlu     130135140     AlaSerAlaSerAlaTrpSerArgIleValThrArgPheSerArgLeu     145150155160     PheAspTyrAlaArgSerHisGlyValHisValMetValAspAlaGlu     165170175     GlnThrAlaIleGlnProAlaValAspArgLeuValLeuAspMetMet     180185190     ArgGluPheAsnArgAspSerAlaValIleThrLeuThrLeuGlnPhe     195200205     TyrLeuLysAspGlnLeuArgPheLeuAspGluCysTyrGlnArgAla     210215220     CysGlnAspAsnPheLeuPheGlyValLysValValArgGlyAlaTyr     225230235240     LeuGluGluGluLysArgValAsnGlyGlyValArgCysPheAlaThr     245250255     LysGlnGluThrAspArgSerTyrAsnAlaAlaValAspTyrIleAla     260265270     LeuArgLeuAspArgIleAlaProPhePheAlaThrHisAsnGluGlu     275280285     SerLeuAlaLeuIleMetSerSerGluSerLeuArgAlaGlyArgThr     290295300     TrpValGlyGlnLeuTyrGlyLeuGlyAspHisIleThrTyrSerLeu     305310315320     LeuGlnThrGlyPheArgValCysLysTyrLeuProTyrGlyProLeu     325330335     AspLysSerLeuProTyrLeuLeuArgArgIleGluGluAsnAlaVal     340345350     AlaSerAlaThrPheLysLysGluAsnLysLeuLeuGlnLysGluLeu     355360365     LeuArgArgLeuValGlyGlyMet     370375     (2) INFORMATION FOR SEQ ID NO: 7:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 92 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:     MetSerLysPheHisCysHisIleArgAspThrAlaIleGlyPhePro     151015     ValSerGluAspGluSerValLeuSerSerAlaTyrGluAlaGlyVal     202530     GluLeuProTyrArgCysAlaSerGlyTyrCysGlyValCysLysVal     354045     ArgLeuThrSerGlyAsnValAsnMetAspHisSerGlyGlyIleSer     505560     ArgLysAspIleAlaAspGlyTyrIleLeuProCysCysSerValPro     65707580     LeuSerAsnLeuGluIleGluProValSerSerCys     8590     (2) INFORMATION FOR SEQ ID NO: 8:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 288 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:     MetLeuLysLysThrLeuIleValGlyLeuCysCysThrPheProLeu     151015     PheSerAlaGlnAlaValAsnThrValProAspGluValValValLys     202530     GlyGlyAsnPheTyrValGlySerValPheGlySerGluAspTyrAla     354045     AlaHisAlaAsnThrSerIleAlaSerPheAlaIleThrLysThrGlu     505560     IleThrTyrArgGlnTyrHisAlaLeuGlnGluTrpAlaAspThrHis     65707580     GlyTyrGluLeuSerGlyGlyCysAsnGlyAlaThrPheGluAspCys     859095     LeuProProGluGlnAspAsnSerLeuHisProValThrAsnValSer     100105110     TrpTrpAspAlaValIlePheAlaAsnValLeuSerGluArgGlnGln     115120125     LeuGlnProTyrTyrLeuThrIleAspGlyLysThrLeuLysArgVal     130135140     ProGluAspAspAsnAspLysLeuIleArgGluAsnProGlnAlaLeu     145150155160     GlyTyrArgLeuProThrLeuAlaGluTrpGlnValAlaAlaArgGly     165170175     GlyLysLysGlyLeuAlaGlnGlyThrTyrGlyGlnArgTyrAlaGly     180185190     SerGluGlnProAspSerValAlaHisPheProSerAspSerGlnSer     195200205     PheGlyThrValProValThrSerLysArgProAsnAlaLeuGlyLeu     210215220     TyrAspMetSerGlyAsnValSerGluTrpLeuAsnGluSerTyrAla     225230235240     ValGluGlyGlyLysThrMetTyrTyrPheCysGlyGlySerTyrLeu     245250255     GluArgThrHisSerLeuAlaSerCysAspLeuHisThrProGlyPhe     260265270     PheMetProAspIleGlyPheArgLeuValArgThrLeuAspGlyGln     275280285     (2) INFORMATION FOR SEQ ID NO: 9:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 177 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9:     MetValAsnLysPheValGlyTrpLeuAlaLeuCysAlaIleSerAsn     151015     ThrAlaAlaAlaLeuSerProValThrLeuLysAspGlyIleAsnArg     202530     LeuAspLeuAsnGlnGlyGlyGlyAsnAspTyrValValValAlaGln     354045     PheAspAsnAsnThrSerHisProAsnLeuGlyMetThrPhePheVal     505560     ArgArgProAspGlyGlyHisSerIleMetProValAlaAsnSerAsn     65707580     ThrPheThrTrpPheAspTyrArgLeuSerAlaAlaAlaAspPheLeu     859095     ValGlnAspAsnArgLeuPheLeuSerGlyLysHisTyrPheLeuVal     100105110     ThrAlaArgLysGlnGlyGluAsnValPheAspProThrLysValVal     115120125     LeuThrIleTyrAspPheLysAlaSerArgAspAspProGlyValPro     130135140     LeuTyrGluTrpSerGluArgLysArgValValThrGlnAspThrTyr     145150155160     GlnSerValAspGluAlaTyrLysGluValAsnGluAlaMetLeuAla     165170175     Lys     (2) INFORMATION FOR SEQ ID NO: 10:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 184 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS:     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (iii) HYPOTHETICAL: NO     (vi) ORIGINAL SOURCE:     (A) ORGANISM: Erwinia carotovora     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:     MetLysIleSerValLeuIleAlaSerGlyLeuLeuAlaSerPheGly     151015     AlaTrpAlaGlnLeuSerGluGlnAspTyrGlnGlnArgValGlnGlu     202530     PhePheAspAlaGluProProLeuCysLeuGlyGluLysGlnTrpPro     354045     ValHisSerProLysGlyAspSerProTrpAsnSerGlyArgLeuHis     505560     ThrLeuValGluAlaGlyLeuAlaTyrAlaThrProGluGlyThrGly     65707580     LysValTyrArgLeuSerProValGlyGluLysAsnTrpArgGlnHis     859095     GlyAspLeuCysTyrGlyArgMetGlnValSerArgIleGluLysIle     100105110     AspArgValAsnGlnGluLeuThrValValTyrPheThrTyrHisLeu     115120125     ThrSerLeuGluSerTrpAlaHisAsnArgSerLeuArgPheAlaPhe     130135140     SerGluLeuAspAsnLeuValGlyGlyMetGluThrThrArgTyrSer     145150155160     AlaThrIleArgGluThrLeuGlyGlyAlaAlaLysLeuGlnAspTyr     165170175     ProValProValGluLeuAspTyr     180     (2) INFORMATION FOR SEQ ID NO: 11:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 18 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: other nucleic acid     (A) DESCRIPTION: /desc = "OLIGONUCLEOTIDE PRIMER"     (iii) HYPOTHETICAL: NO     (iv) ANTI-SENSE: NO     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:     CCGSGACATCTACTCSGC18     (2) INFORMATION FOR SEQ ID NO: 12:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 18 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: other nucleic acid     (A) DESCRIPTION: /desc = "OLIGONUCLEOTIDE PRIMER"     (iii) HYPOTHETICAL: NO     (iv) ANTI-SENSE: NO     (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:     CCGTCCTCCCAGAAGTGC18     __________________________________________________________________________ 

We claim:
 1. A DNA fragment comprising:(a) at least one of the genes carA, carB, carC, carD, carE, carF, carG, carH, as defined in FIG. 4; (b) DNA which specifically hybridizes to any of the genes defined in (a) and which encodes polypeptides which function in the biosynthetic pathway of a carbapenem; (c) DNA encoding polypeptides having the same amino acid sequence as polypeptides encoded by the DNA defined in either (a) or (b) above, by virtue of the degeneracy of the genetic code.
 2. A DNA fragment comprising:(a) the cluster of at least genes carA, carB, and carC as defined in FIG. 4; (b) DNA which specifically hybridizes to the genes defined in (a) and which encodes polypeptides which function in the biosynthetic pathway of a carbapenem; (c) DNA encoding polypeptides having the same amino acid sequence as polypeptides encoded by the DNA defined in either (a) or (b) above, by virtue of the degeneracy of the genetic code.
 3. A DNA fragment comprising:(a) the cluster of genes carA, carB, carC, carD, and carE as defined in FIG. 4; (b) DNA which specifically hybridizes to the genes defined in (a) and which encodes polypeptides which function in the biosynthetic pathway of a carbapenem; (c) DNA encoding polypeptides having the same amino acid sequence as polypeptides encoded by the DNA defined in either (a) or (b) above, by virtue of the degeneracy of the genetic code.
 4. A DNA fragment according to any one of claims 1, 2 or 3, also containing either or both of the regulatory genes carR, as defined in FIG. 4, and carI.
 5. A DNA fragment as defined in any one of claims 1, 2 or 3, wherein the gene is a gene of Erwinia carotovora.
 6. A DNA fragment as defined in any one of claims 1, 2 or 3, wherein said DNA comprises one or more native genes from a carbapenem encoding organism or partially or wholly synthetic DNA.
 7. A polypeptide coded for by a DNA fragment as defined in claim
 1. 8. A recombinant DNA molecule including a DNA fragment as defined in claim
 1. 9. A recombinant DNA molecule according to claim 8, including the cluster of genes carA, carB, carC, carD, carE, carF, carG, carH.
 10. A recombinant DNA molecule according to claim 8 or 9 which also contains either or both of the genes carI and carR.
 11. A recombinant DNA molecule according to claim 2 which is an expression vector capable of expressing a carbapenem polypeptide coded for by said gene cluster.
 12. A method identifying a DNA encoding a polypeptide functioning in the production of a carbapenem by using a DNA fragment as defined in claim 1 or a portion of such a DNA fragment as a specific hybridization probe, incorporating any DNA so identified into an expression vector, transforming a host cell with the resulting recombinant expression vector, expressing the polypeptide coded for by said DNA, and testing said polypeptide for carbapenem production activity.
 13. A method according to claim 12, wherein said DNA is amplified by means of PCR.
 14. A method for identifying a DNA encoding a polypeptide functioning in the production of a carbapenem by using a DNA fragment as defined in claim 4 or a portion of such a DNA fragment as a specific hybridization probe, incorporating any DNA so identified into an expression vector, transforming a host cell with the resulting recombinant expression vector, expressing the polypeptide coded for by said DNA, and testing said polypeptide for carbapenem production activity.
 15. A DNA identified by the method of claim
 12. 16. A recombinant DNA vector incorporating DNA as defined in claim
 15. 17. A polypeptide coded for by DNA as defined in claim 15 or by a recombinant DNA vector as defined in claim
 16. 